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SOLVING  GEOMETRIC  PROGRAMS  USING  GRG-RESULTS  AND  COMPARISONS 

By 


M.  Ratner,  L.  S.  Lasdon  and  A.  Jain 


Introduction 

Thin  paper  describes  the  performance  of  a generalized  reduced 
gradient  (GRG)  algorithm  in  solving  geometric  programs.  The  code  used, 
described  in  [5],  is  a general  purpose  nonlinear  programming  code,  and 
takes  no  advantage  of  the  structure  of  geometric  programs.  First  partial 
derivatives  of  the  objective  and  all  constraint  functions  are  required, 
and  these  are  computed  by  simple  forward  difference  approximations.  All 
problem  functions  are  expressed  in  power  form,  i.e.,  each  term,  t^,  has 
the  form 


Problems  Solved  and  Measures  of  Comparison 

rnhe  geometric  programs  solved  come  from  two  sources:  H problems 

given  by  D'  nbo  in  f L ] and  the  P4  problems  of  Rijekaert  and  Martens  in  [•']. 
Problem  sizes  are  given  in  Table  1 below.  The  problems  are  good  examples 
of  small,  de^ce,  highly  nonlinear  NLP's.  The  problems  with  some  negative 
terms  are  generalized  geometric  programs  with  signomial  constraints. 
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1 TABLE  1 


Problem  Size 


No.  of 

No.  of 

No.  of 

No.  of 

No.  of 

positive 

negative 

binding  constraints 

Problem 

variables 

constraints 

terms 

terms 

at  optimality 

D1 

12 

3 

31 

0 

| 

3 ! 

D2 

5 

6 

15 

8 

2 

D3 

7 

14 

31 

13 

5 

d4a,b 

8 

It 

14 

2 

4 

D4C 

8 

5 

16 

0 

5 

D5 

8 

6 

14 

5 

6 

D6 

13 

13 

27 

12 

11 

D7 

16 

19 

40 

21 

*■  *"  ! 

D8A,B, 

C 7 

4 

18 

0 

A;  2,  B:  3,  C:  4 

These  may  have  local  optima  which  are  not  global  (such  a point  was 
encountered  in  at  least  one  problem). 


Measures  of  Comparison 

In  comparing  GRG  with  the  code  used  by  Dembo  in  [2]  (one  of  the 
better  special  purpose  GP  codes)  two  measures  were  available--the  final 
objective  -value  obtained  and  the  "standard  time"  required  to  achieve  that 
value.  Standard  time  is  the  execution  time  for  the  problem  divided  by 
the  time  to  execute  a timing  program  written  by  Colville  [ 1 ] . This 
program  inverts  a 40  by  40  matrix  10  times.  Use  of  standard  time  is 
supposed  to  compensate  for  the  effects  of  different  computing  environments, 
e.g.,  machines,  compilers,  etc.  To  investigate  this  we  solved  4 problems 
on  the  IBM  3To/l68  at  Stanford  University  using  three  different  FORTRAN 
compilers:  the  FORTRAN  H compiler  (0PT=2 ) , the  WATFIV  compiler  with  the 

CHECK  option  and  the  WATFIV  compiler  with  the  NOCHECK  option.  The  results 
appear  in  Table  2,  which  gives  the  times  required  by  GRG  to  solve  four 
problems  (with  minimal  printed  output)  divided  by  the  time  required  to 
run  the  timing  program.  There  is  great  variation  in  standard  times  between 
the  three  compilers,  with  widest  variation  (by  factors  of  from  3 to  10' 
between  WATFIV  (CHECK)  and  the  FORTRAN  H compilers.  Evidently  this 
naive  way  of  compensating  for  computing  environment  is  inadequate. 

To  compare  with  the  other  published  results,  we  chose  the  WATFIV 
NOCHECK  compiler,  partly  for  convenience,  partly  because  it  gave  the 
median  times.  In  all  GRG  runs  there  was  no  printing  of  intermediate 
results,  but  input  data  and  final  results  were  printed.  In 
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problems  with  run  times  less  than  1 second,  even  this  printing  may 
consume  a large  fraction  of  total  time. 

Comparison  with  the  Rijckaert  and  Martens  results  is  difficult, 
since  their  starting  points  were  chosen  randomly,  and  were  not  published. 
We  chose  our  starting  valuer  so  that  odd -subs erupted  variables  were 
one-half  their  optimal  value,  and  even-subscripted  variables  were 
three-halves  their  optimal  value.  The  resulting  points  are  shown  in 
Appendix  A. 


Computational  Results 

Table  3 shows  the  performance  of  GRG  on  the  Dernbo  problems  on 
our  first  attempt.  Problem  1A  was  too  badly  scaled  to  attempt  solution, 
and  the  code  failed  on  Problems  3,  6 and  7.  In  Problems  3 and  GRG 
terminated  prematurely  when  no  decrease  in  the  objective  was  achieved 
while  attempting  to  move  in  the  direction  of  steepest  descent,  while 
in  Problem  7 the  program  terminated  short  of  feasibility  at  a local 
optimum  of  the  Phase  I objective. 

Improved  results  were  obtained  by  using  an  alternative  pivoting 
strategy  in  computing  the  basis  inverse.  This  strategy  allowed  pivoting 
on  matrix  elements  smaller  than  allowed  by  the  previous  strategy  if  the 
alternative  was  en’.ering  a variable  at.  a bound  into  the  basis.  This 
avoided  degenerate  bases  in  some  cases,  and  allowed  solution  of  problem 
3 and  improved  performance  on  number  3 (see  Table  4). 
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Computational  Results  for  Dembo  GP  Problems,  Using  Specified  Constraint  Tolerances 
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Table  5 shows  the  effects  of  another  modification  to  GRG. 

The  code  uses  the  BFS  variable  metric  method  to  minimize  the  reduced 
objective.  The  original  strategy  was  to  update  the  approximation,  H, 
to  the  inverse  hessian  used  by  this  method  only  when  the  line  search 
terminated  in  an  unconstrained  optimum.  Otherwise  it  was  reset  to  the 
identity,  and  the  search  direction  became  the  negative  reduced  gradient. 
The  new  strategy  used  the  BFS  update  at  each  iteration,  except  those 
at  which  a basis  change  occurred.  In  the  5 problems  of  Table  5,  this 
new  strategy  was  better  in  all  problems  but  one,  significantly  better 
in  3 problems. 

Some  Dembo  problems  (whose  feasibility  tolerance  specified  was 

-k, 

tighter  than  10  J were  re-run  using  the  default  feasibility  tolerance 
(10‘4)  of  the  GRG  code.  As  shown  in  Table  6,  solutions  wer  obtained 
faster  than  with  the  specified  tolerances  (Table  3)*  This  prompted 
the  use  of  a coarse  tolerance  to  obtain  an  initial  solution,  followed 
by  a refinement  using  the  specified  tolerances.  As  shown  in  Table  7, 
this  strategy  yielded  a significant  decrease  in  computational  effort 
for  Problems  8A,  SB  and  8C. 

The  performance  of  GRG  on  the  Rijckaert -Martens  problems  is 
shown  in  Table  8.  The  column  "Reported  S.T."  contains  the  best 
standard  time  reported  by  Rijckaert  and  Martens  [6]  in  a comparison 
of  eleven  special  purpose  codes  for  geometric  programming  and  one  general 
purpose  code.  GRG  was  generally  slower  than  the  best  code  and  missed  the 
true  optimum  by  one  to  two  percent  in  Problems  8,  13  and  15.  Otherwise, 
GRG  solved  all  these  problems  satisfactorily. 
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Computational  Results  for  Rijckaert  and  Martens  Gr  Problems 
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Problem  23  was  the  same  as  Dembo  #2  so  it  was  not  rerun;  Problem  20  had  an  unresolved  typographical  error. 

The  feasibility  tolerance  used  was  10“^  in  contrast  to  the  stricter  tolerance  of  10  5 used 
by  P.i.ickaert  and  Martens.  This  difference  would  tend  to  bias  results  in  favor  of  GRG. 

« . d 

Tolerance  controlling  termination  had  to  be  tightened  by  a power  of  10. 


Note  that  li-<:  is  competitive  or  superi cr  lu  i * s s i.aridard  ‘ ice 
on  the  larger  prchl  ems , 1'J  thru  ; it . ..  ince  all  times  oxct-p;  * w are 

on  the  order  of  1 second,  the  printing  of  runic  output  by  ;P -,wn • eh 
may  consume  a large  fraction  of  run  : Line  in  these  cases  and  ■.< 
previously  mentioned  difficulties  with  using  s'  andarl  times,  -'..ply  that 
these  comparisons  must  be  taken  with  a large  grain  of  salt. 

An  enhancement  of  the  OKI  cole,  described  in  [ ij , ares  ;uadrati  ; 
extrapolation  to  compute  initial  estimates  of  basic  variables  prior  to 
solution  of  the  nonlinear  constraint  equations  in  contrast  to  tangent 
vector  extrapolation  ; h]  used  in  the  runs  described  above.  Some  of  the 
Dembo  ard  hi.jekaert -Martens  problems  were  used  in  tests  to  compare  the 
two  extrapolation  schemes.  The  results,  displayed  in  Tables  ‘)  and  10, 
(which  exhibit  minor  discrepancies  with  the  results  in  Tables  3-b  owing 
to  minor  differences  in  tolerances  and  strategies  used)  show  the 
superiority  of  quadratic  extrapolation  for  these  problems. 


Conclusions 

Conclusions  to  be  'irawn  from  these  experiments  are; 

1.  ''Standard  time',"'  as  defined  by  Colville  in  II],  is  an  inadequate 
means  of  compensating  for  different,  computing  environments  when 
attempting  to  compare  optimisation  algorithms.  Improved  procedures 
are  needed. 

>Rf , representing  the  class  of  general  purpose  NLP  algorithms,  competes 
well  with  special  purpose  geometric  programming  codes  in  solving 
geometric  programs. 
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^j.  if.  -Wk&si  : 


3.  Certain  modifications  in  solution  strategy  can  strongly  affect  the 

performance  of  GRG.  Among  these  are:  when  the  approximate  hessian 

is  reset,  the  logic  used  in  basis  inversion  to  decide  when  a variable 
at  bound  is  to  enter  the  basis,  and  the  order  of  extrapolation 
(linear  or  quadratic)  used  to  obtain  initial  estimates  of  the 

basic  variables. 

4.  Certain  parameter  settings  strongly  affect  GRG  performance:  in 

particular,  the  tolerance  used  to  determine  which  constraints  are 
binding,  and  the  tolerance  used  to  terminate  the  algorithm. 

In  closing,  we  note  some  things  left  undone  but  worth  doing.  GRG 
could  easily  be  made  more  convenient  and  efficient  on  geometric  programs 
by  coding  a special  subroutine  to  compute  first  partial  derivatives.  This 
would  use  the  fact,  that  if  the  ith  term  in  the  program  is 


then 


t. 

1 


n 


n 

j=l 


dt. 
i 

3x, 


a. , t. 
lk  i 


5 

I 


\ 


1 

1 

\ 


l 


s 


t 
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Hence,  if  the  terms  are  stored  when  computing  the  constraint  and  objective 
values,  their  partial  derivatives  are  available  with  little  additional 
effort.  This  would  reduce  the  time  required  to  compute  the  gradient 
of  a function  from  the  time  required  in  these  runs  Int^.,  where  t^,  is 
the  time  required  to  evaluate  the  function  and  n is  the  number  of 
variables)  to  little  more  than  t^,.  Special  input  subroutines  could 
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be  coded  to  enable  the  user  to  specify  the  problem  by  inputting  only 

(a)  the  constants  c.  , (b)  the  exponent  matrix  a.  . and  (e  which 

1 xj 

terms  appear  in  which  problem  functions.  Currently,  all  problem  functions 
must  be  coded  directly.  These  enhancements  would  transform  GhJ  into  a 
"special  purpose"  geometric  programming  code. 

Some  additional  experiments  appear  useful.  Geometric  programs 
can  be  transformed  into  exponential  form  by  the  change  of  variables 


which  transforms  the  ith  term  into 

t.  = c.  expel,  a.  .y . ) 
x x . i/o 

Evaluation  of  t^  then  requires  only  one  transcendental  computation 

rather  than  one  for  each  fractional  a. ..  In  addition,  y.  is  a free 

xj  .1 

variable  (if  x.  has  no  upper  bound),  and  the  problem  functions  become 
convex  if  all  c^  are  positive.  Some  problems  should  be  solved  using 
both  forms,  to  see  which  yields  smallest  solution  times.  In  addition, 
tests  of  GRG  and  some  good  geometric  programming  codes  should  be  run  on 
the  same  computer,  in  order  to  remove  the  factor  of  standard  times  from 
obscuring  the  comparisons. 
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APPENDIX  A 
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Rijckaert -Martens  Problems  in  GR1  Runs 
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